Comparison of Feature Usage at TSC-3 Summarization Tasks

نویسندگان

  • Chikashi Nobata
  • Satoshi Sekine
  • Kiyotaka Uchimoto
  • Hitoshi Isahara
چکیده

We participated in two summarization tasks at the TSC-3. We have introduced categorization of feature values for our summarization system, which is based on sentence extraction technique. The categorized values were used as features for generating a decision tree. We compared our summarization system using the categorization of feature values with the one using linear combination of features in evaluation results of the TSC-3 tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Kernel-based Approach for Automatic Evaluation of Natural Language Generation Technologies: Application to Automatic Summarization

In order to promote the study of automatic summarization and translation, we need an accurate automatic evaluation method that is close to human evaluation. In this paper, we present an evaluation method that is based on convolution kernels that measure the similarities between texts considering their substructures. We conducted an experiment using automatic summarization evaluation data develo...

متن کامل

Phrase-representation Summarization Method and Its Evaluation

We have developed a summarization method that creates a summary suitable for the process of sifting information retrieval results. Unlike conventional methods that extract important sentences, this method constructs short phrases to reduce the burden of reading long sentences. We developed an improved task-based evaluation method and applied to prove the effectiveness of phrase-represented summ...

متن کامل

NTT/NAIST's Text Summarization Systems for TSC-2

In this paper, we describe the following two approaches to summarization: (1) only sentence extraction, (2) sentence extraction + bunsetsu elimination. For both approaches, we use the machine learning algorithm called Support Vector Machines. We participated in both Task-A (single-document summarization task) and Task-B (multi-document summarization task) of TSC-2.

متن کامل

Elimination of Multiple Modifiers in Summarization

We propose a method that summarizes a Japanese sentence. The method aims to produce a natural and readable summary from the sentence. This method eliminates a part of multiple adnominal modifiers including adnominal clauses by employing natural language processing tools: KNP (a parser), and JUMAN (a morphological analyzer). With this proposed method, we participated in subtask A-2 (for producin...

متن کامل

Summarization as Feature Selection for Document Categorization on Small Datasets

Most common feature selection techniques for document categorization are supervised and require lots of training data in order to accurately capture the descriptive and discriminative information from the defined categories. Considering that training sets are extremely small in many classification tasks, in this paper we explore the use of unsupervised extractive summarization as a feature sele...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004